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Abstract 

We study a multi-marginal optimal transportation problem. Under 

certain conditions on the cost function and the first marginal, we prove 

that the solution to the relaxed, Kantorovich version of the problem 

induces a solution to the Monge problem and that the solutions to 

^H ' both problems are unique 

q 

O 1 Introduction 

o' 

Given two Borel probability measures ,Ui and // 2 on spaces M\ and M 2 respec- 
tively and a cost function c : M\ x M 2 — > IR, Monge's optimal transportation 
^\ problem asks how to most efficiently map ji\ onto [J,2, where efficiency is 

measured relative to c. We say that a function G : Mi — y M 2 pushes ii\ 
forward to // 2 and write G#fj,i = /z 2 if iii(G~ 1 (B)) = ll<i{B) for all mea- 
surable B C M 2 . Monge's problem is then to minimize f M c{x\,G(xi))dLii 
among all G that push li\ forward to ii 2 . Due to both its deep connections 
to other areas of mathematics and its applicability in other fields, optimal 
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transportation has grown into a thriving field of research over the past 20 
years; we refer the interested reader to the books by Villani for references 

mm- 

In this paper, we are interested in a multi-marginal version of the preced- 
ing problem. Given Borel probability measures fa on n- dimensional, smooth 
manifolds Mj, for i — 1, 2, ..., m, and a cost function c : M x x M 2 x .... x M m — > 
M., the multi-marginal version of Monge's optimal transportation problem is 
to minimize: 

C(G 2 ,G 3 ,...,G m ) := c(xi,G 2 (x 1 ),G 3 (xi),...,G m (x 1 ))diJ,i (M) 

J Mi 

among all (m — l)-tuples of measurable maps (G 2 ,G 3 , ...,G m ), where Gi : 
Mi — > Mi pushes /ii forward to fa for all i = 2,3, ...,m. The Kantorovich 
formulation of the problem is to minimize: 

C(/i) := / c(x 1 ,x 2 ,x 3 ,...,x m )dn (K) 

J M 1 xM 2 x...xM rn 

among all positive Borel measures /i on Mi x M 2 x ... x M m such that the 
canonical projection 

TTi : Mi x M 2 x ... x M m -)- M t 

pushes \i forward to fa for all i. For any (m — l)-tuple (G 2 , G 3 , ..., G m ) 
such that Gj#/ii = ^ for all i = 2,3, ...,m, we can define the measure 
/x = (Id, G 2 , G 3 , ..G A /) # /ii on Mi x M 2 x ... x M m , where Id : M 1 ^ Mi is the 
identity map. Then fi projects to fa for all i and C(G2, G3, ..., G m ) = C(//); 
therefore, K can be interpreted as a relaxed version of M. Roughly speaking, 
the difference between the two formulations is that in M almost every point 
xi G Mi is coupled with exactly one point Xi G M, for each i = 2,3, ...,m, 
whereas in K an element of mass at X{ is allowed to be split between two or 
more target points in Mi for i = 2, 3, ..., m. 

Assuming that c is continuous, it is not hard to show that a solution 
to K exists. When m — 2, under a regularity condition on /i X and a 
twist condition on c, which we will define in the next section, one can 
show that this solution is concentrated on the graph of a function over Xi 
[H] U\ [I] [H] ■ It is then straightforward to show that this function solves 
M and to establish uniqueness results for both M and K. When m > 3, 
however, existence and uniqueness in M as well as uniqueness in K are still 



largely open. In their seminal paper, Gangbo and Swi§ch [§] used a du- 
ality theorem of Kellerer [11] to resolve these questions for the quadratic 
cost function, c(xi,x 2 ,Xs, ...,x m ) = — | YlT=i x i\ 2 on K™ x M n x .... x M. n , ex- 
tending partial results for the same cost by Olkin and Rachev [IT], Knott 
and Smith [13] and Riischendorf and Uckelmann [19J. Gangbo and Switch's 
theorem was then reproved using a different argument by Riischendorf and 
Uckelmann |2Uj and generalized by Heinich [TU] to cost functions of the form 
c(xi, X2, X3, ..., x m ) = h(J2iLi x i) where h : MJ 1 — > M is strictly concave. Car- 
rier extended these results to a still wider class of cost functions, but only in 
the case when all the domains Mi are one dimensional [3]. A result with a 
somewhat different flavour was obtained by Carlier and Nazaret [5] . Assum- 
ing n = m and M t = M n for all i, their objective was to maximize a convex 
function of the determinant of the matrix whose columns are the vectors Xi\ 
for this cost function, they demonstrate that the maximizer may not be con- 
centrated on the graph of a function over x\ and may not be unique. The 
aim of the present article is to identify general conditions on c under which 
both K and M admit unique solutions. 

With one exception, the conditions we impose will look similar to stan- 
dard conditions which arise when studying the two marginal problem. Our 
lone novel hypothesis is that a certain covariant 2-tensor on the product space 
M 2 x M 2 x ... x M m _! should be negative definite. Tensors have recently be- 
come very relevant to optimal transportation, owing to their essential role 
in the study of the regularity of optimal maps. Ma, Trudinger and Wang 
[15] showed that a certain tensorial condition dictates the regularity of the 
solution to M in the two marginal case, and Kim and McCann [12] rein- 
terpreted this condition, relating it to the sectional curvature of a certain 
pseudo-metric. In [T5] , the present author showed that the dimension of the 
support of the optimizers for mult i- marginal problems is related to a fam- 
ily of symmetric, bi-linear forms. One surprising consequence of that work 
is a class of counterexamples demonstrating that the obvious generalization 
of the twist condition to the multi-marginal setting is sufficient neither to 
guarantee uniqueness of minimizers for K nor to ensure that the solution to 
K induces a solution to M. In these examples, the solutions were concen- 
trated on submanifolds of dimension greater than n; motivated in part by 
this observation, [18] identified local conditions on c under which the support 
of the optimal measure is at most n-dimensional. Our condition here is a 
little different. Whereas the question about the dimension of the support of 
a solution \x to K is purely local, showing that // gives rise to a solution to M 



is a global issue: for almost all X\ G M\ we must show that there is exactly 
one (x 2 , X3, ..., x m ) G M 2 x M 3 x, ..., M m which get coupled to X\ by /U. Our 
tensor here is designed to capture this global nature of the problem. 

In the next section we recall relevant concepts from the theory of optimal 
transportation and formulate the conditions we will need. In section 3 we 
state and prove our main result and in the fourth section we exhibit several 
examples of cost functions which satisfy the criteria of our main theorem. 

Acknowledgment: It is a pleasure to thank my advisor, Robert Mc- 
Cann, for his support and for many useful discussions throughout the dura- 
tion of this work. 

2 Preliminaries and definitions 

We will assume that each Mj can be smoothly embedded in some larger 
manifold in which its closure Mj is compact and that the cost c G C 2 (Mi x 
M 2 x ... x M m ). In addition, we will assume that Mj is a Riemannian manifold 
for i = 2,3, ..m — 1 and that any two points can be joined by a smooth, length 
minimizing geodesiqj, although no such assumptions will be needed on Mi 
or M m . The requirement of a Riemannian structure is related to the global 
nature of M that we alluded to in the introduction; a Riemannian metric 
gives us a natural way to connect any pair of points, namely geodesies. 

We will denote by D Xi c{x,\, x 2 , ■■■,x m ) the differential of c with respect to 
Xj. For i 7^ j, the bi-linear form D x . x .c(xi, #2, ■■■, x m ) on T x .Mj x T x .Mj was 
introduced in [T2]; in local coordinates, it is defined by 

2 d d _ d 2 c 

XiX i C{ dx? ' dx a / '~ dxfdxj' 

As Mj is Riemannian for i = 2, ...,m — 1, Hessians or unmixed, second order 
partial derivatives with respect to these coordinates make sense and we will 
denote them by Hess Xi c(xi,x 2 , ...,x m ); note, however, that no Riemannian 
structure is necessary to ensure the tensoriality of the mixed second order 
partials D x . x .c(x\,X2, ...,x m ), as was observed in [12| . 

Given a Borel probability measure \i on a topological space M, the sup- 
port of /i, which we will denote by spt([i), is defined to be the smallest closed 
subset of M such that fj,(spt(fi)) = 1. 



1 Note that we do not assume Mj is complete, however, as we do not wish to exclude, 
for example, bounded, convex domains in W l . 



The dual problem to K is to maximize 

m „ 

S2 / Ui(xi)dni (D) 

among all m-tuples {u\, u 2 , ..., u m ) of functions Ui G L 1 ^) for which YlT=i u i( x i) — 
c(xi, ...,x m ) for all (x±, ...,x m ) G Mi x M 2 x ... x M m . 

There is a special class of functions satisfying the constraint in D that 
will be of particular interest to us: 

Definition 2.1. We say that an m-tuple of functions (u\,u 2 , ■■u m ) is c- 
conjugate if for all i 

Ui(xi) = inf (c(x 1 ,x 2 ,...,x m )-y^u j (x j )) 

Whenever (ui,u 2 , ■■u m ) is c-conjugate, the u^ are semi-concave and hence 
have super differentials dui(xi) at each point Xi G M; L . By compactness, 
for each xi G Mi we can find Xj G Mj for all j ^ i such that u(xi) = 
c(xi,x 2 , ...,x m ) — J2j 7 ti u j( x j)i furthermore, as long as |mj(xj)| < oo for at 
least one Xi, Ui is locally Lipschitz [16]. 

The following theorem makes explicit the link between the Kantorovich 
problem and its dual. 

Theorem 2.1. There exists a solution \x to the Kantorovich 'problem and a 
c-conjugate solution (u\,u 2 , ...,u m ) to its dual. Furthermore, the maximum 
value in D coincides with the minimum value in K. Finally, for any solution 
H to K, any c-conjugate solution {u\,u 2 , ...,u m ) to D and any (xi, ...,x m ) G 
spt(n) we have Y^Li u i( x i) = c(xi, ...,x m ). 

This result is well known in the two marginal case; for m > 3, the ex- 
istence of solutions to K and D as well as the equality of their extremal 
values was proved in [TT]. The remaining conclusions were proved for a spe- 
cial cost by Gangbo and Switch [H] and for a general, continuous cost when 
each Mi = M. n by Carlier and Nazaret [5]. The same proof applies for more 
general spaces M^ we reproduce it below in the interest of completeness. 

Proof. As mentioned above, a proof of the existence of solutions \x to K and 
(vi,v 2 , ..., v m ) to D as well as the equality: 

y] / Vi(xi)dfii= c(xi,x 2 ,x 3 ,...,x m )dfi (1) 

i=l J M i JMixM 2 ....xM m 



can be found in [TT]. We use a convexification trick, also found in [9] and [5], 
to build a c-conjugate solution to D. 
Define 



Wi(xi) = inf (c(x 1 ,x 2 ,...,x m ) -y^Vjixj)) 

XjGMj V f — » / 

and Mj inductively by 



j>2 J= 2 



i-1 



x i. 



Mj(xi) = mf ( c(xi, x 2 , ■-., %) - y]%0Ej) - y^ Uj-I 

As 

m— 1 

«m(i m ) = inf (c(xi,x 2 ,...,x m ) - y~]uj(xj)), 

we immediately obtain 

Ui(xi) < inf (c(a;i,X2,...,x m ) - Y]uj(xj)). (2) 

The definition of Ui-\ implies that for all (xi, x 2 , ••■x m ) 



i-1 



Vi(xi) < c(xi,x 2 , ...,x m ) - ^2uj(xj) - 22 v j( x i) 

3=1 j=i+l 

Therefore, fj(xj) < "Uj(xj). It then follows that 

i— 1 m 

Ui(xi) = inf ( c(xi, x 2 , ..., x m ) - y^ Ujjxj) - ^ Vjjxj)) 

3 j^i J i =1 j=i+l 

> inf (c(xi,X2,..;X m )-y~)uj(xj)), 

which, together with (J2J), implies that (u\,U2, ■■■,u m ) is c-conjugate. Now, we 



have 



m „ m „ 

y~l / Vi(xi)diM < ^ / Ui(xi)diJLi 

i=\ Jm * i=l jM i 

m „ 

= ^2 Ui{xi)dfi 

„•_ 1 J M-\X.Mo....xM m 



' MixM 2 ....xM„ 

< / c(xi,x 2 ,x 3 ,...,x m )cfyt 

'MixM 2 ....xM m 



Ui(xi)dn= / c(x 1 ,x 2) a;3,... ) a; m )d// 



and so by ([!]) we must have 



J^ / in(xi)dni = ^2 

i= l J Mi i=1 J M 1 xM 2 ....xM m J MixM 2 ....xM„ 

But because X^Li ^(^i) — cO^i; x 2, £3, -,%), we must have equality /1 
almost everywhere. Continuity then implies equality holds on spt(n). □ 

As a corollary to the duality theorem, we now prove a uniqueness result 
for the solution to D. When m — 2, this result, under the weak conditions 
on c stated below, is due to Chiappori, McCann and Nesheim [SJ; for certain 
special, multi-marginal costs, it was proven by Gangbo and Swi§ch [9] and 
Carlier and Nazaret [5J. Although this result is tangential to the main goals 
of this article, we prove it here to emphasize that, whereas uniqueness in K 
requires certain structure conditions on the cost, uniqueness in D depends 
only on the differentiability of c. 

Corollary 2.1. Suppose the domains Mi are all connected, that c is con- 
tinuously differentiable and that each fa is absolutely continuous with respect 
to local coordinates with a strictly positive density. If (v\, v 2 , ..., v m ) and 
(vi, V2, ...,%) solve D, then there exist constants U for i = l,2.,,,m such 
that Yl^Li U = an d v i = Vi + ti, Hi almost everywhere, for all i. 



Proof. Using the convexification trick in the proof of Theorem 12. 1[ we can 
find c-conjugate solutions («i,« 2 ,...,« m ) and («T, U2, —,Um) to D such that 
vAxj) < uAxj) and vj(xi) < u~Ax;) for all X; G M,-. Now, as 



m „ m „ 

^ / Vi(xi)dfa = ^2 Ui(xi)d^ 

i=l Jm * i=l jM i 



we must have fj = Ui, \Xi almost everywhere. Similarly, vl = ui, Hi almost 
everywhere. Now, choose Xj G M, where Mj and ui are differentiable. Then 
there exists Xj for all j^i such that 

\Xi : Xii ■■•) Xi— i, Xj, Xj_|_i ..., X m J fc SpZyjJ,)^ 

Theorem 12.11 then yields 

Ui\Xi) C\X\ , X2, ••■, Xi— 1, Xj, Xj_|_i ..., X m J / UjyXj j . 

Because 

Ui\Zi) — c{Xi, X2, ..., a?j— 1, ^i, Xj+i..., x m ) < — 2 u j[ x j) 

for all other ^ G Mj we must have 

UUiyXiJ i/jjj^C^Xi, X2, •••, Xi—\ , Xj, Xj-|_i..., X m J . 

Similarly, 

L)'U,; l \X; l j L) Xi C\X\, X2, •••, Xj_i , Xj, Xj_|_i..., X m J , 

hence Dui(xi) = Dui{xi). As this equality holds for almost all Xj we conclude 
«j(xj) = Mj(xj) + U for some constant U. Choosing any (xi, X2, ••-, x m ) G 
spt(/i) and noting that 



•m 

8=1 i=l 



^Wj(xj) = c(xi,x 2 ,...,x m ) = ^ 



we obtain Y1T=\ ^ = 0- ^ 

The next two definitions are straightforward generalizations of concepts 
borrowed from the two marginal setting. 

Definition 2.2. For i 7^ j, we say that c is (i,j) -twisted if the map Xj G 
Mj (->• D x .c(xi,X2, ...,x m ) G T*,Mi is injective, for all fixed x\., k ^ j. 

Definition 2.3. We say thatc is (i, j) -non-degenerate ifD xx .c(xi,X2, ■■■, x m ), 
considered as a map from T x .Mj to T*,Mi, is injective for all (xi, X2, ..., x m ). 



In local coordinates, non-degeneracy simply means that the corresponding 
matrix of mixed, second order partial derivatives has a non-zero determinant. 
When this condition holds, the inverse map T*.Mi — y T x .Mj will be denoted 
by (Dl iX] c)- 1 (x 1 ,x 2 ,...,x m ). 

When m — 2, the non- degeneracy condition is not needed to ensure the 
existence of an optimal map (although it plays an important role in studying 
the regularity of that map). On the other hand, the twist condition plays 
an essential role in showing that Monge's problem has a solution; it ensures 
that a first order, differential condition arising from the duality theorem can 
be solved uniquely for one variable as a function of the other [H] (see also 
[T], [5] and [2]). In light of this, one might expect that, for m > 3, if c is 
(i, j)-twisted for all i ^ j, then the Kantorovich solution // induces a Monge 
solution. This is not true, as our example in [18] demonstrates. In the multi- 
marginal problem, duality yields m first order conditions; our strategy in 
this paper is to show that if we fix the first variable, these equations can be 
uniquely solved for the other m — 1 variables. In the problems considered by 
Gangbo and Swiech [2j and Heinich [TO], these equations turn out to have a 
particularly simple form and can be solved explicitly. For more general cost 
functions, this becomes a much more subtle issue. Our proof will combine a 
second order, differential condition with tools from convex analysis and will 
require that the tensor T, defined below, is negative definite. 

Definition 2.4. Suppose c is (l,m) -non-degenerate. Lety= (yi,y2, ■■■,y m ) £ 

Mi x M 2 x ... x M m . For each i := 2,3, ...,m — 1 choose a point y(i) = 
(y 1 (i),y 2 {i),...,y m {i)) E M 1 x M 2 x ... x M m such that y^i) = y{i). Define 
the following bi-linear maps on T V2 M 2 x T y3 M^ x ... x T 2/m _ 1 M m _x.' 

m— 1 m— 1 to— 1 

s y = - E E D l*Ay) + E ( D l lX ADi 1Xm cr l Di iXjC ){y) 

j=2 i=2 i,j=2 

m— 1 

Hy, m ,y(3),...,y(m-i) = ^2(Hess Xi c(y(i)) - Hess Xi c(y}) 

i=2 



T^y-(2),y(3),...,y(m-l) ~ S$ + i%?(2),#(3),...,j?(m-l) 



Note that D\ c(xi,xz, ...,x m ),Hess x c(xi,X2, ■■■,x m ) and the composi- 
tion (D XiXm c(Dl iXm c)~ 1 Dl iX .c)(xi,X2, ..., x m ) are actually bi-linear maps on 
the spaces T x .Mi x T x .Mj, T x .Mi x T x .Mi and T x .Mi x T x .Mj, respectively, 
but we can extend them to maps on the product space (T X2 M 2 x T X3 M 3 x 
... x T Iml M m _i) 2 by considering only the appropriate components of the 
tangent vectors. 

Though T looks complicated, it appears naturally in our argument. The 
condition T < is in one sense analogous to the twist and non-degeneracy 
conditions that are so important in the two marginal problem. Like the 
non- degeneracy condition, negativity of S is an inherently local property on 
Mi x M2 x ... x M m ; under this condition, one can show that our system of 
equations is locally uniquely solvable. To show that the solution is actually 
globally unique requires something more; in the two marginal case, this is the 
twist condition, which can be seen as a global extension of non- degeneracy. 
In our setting, requiring that the sum T = S + H < turns out to be enough 
to ensure that the locally unique solution is in fact globally unique. 

3 Monge solutions 

We are now in a position to precisely state our main theorem: 
Theorem 3.1. Suppose that: 

1. c is (l,m) -non- degenerate. 

2. c is (l,m) -twisted. 

3. For all choices of y — (yi,y2, ■■■,y m ) *= -^l x M 2 x ... x M m and of 
y(i) = (yi(i),y2(i),—,y m (i)) e M 1 x M 2 x ... x M m such thaty^i) = y t 
for i — 2, ...,m — 1, we have 

Ty,y(2),y(3),...,y(m-1) < 0. (3) 

4- The first marginal [i\ does not charge sets of Hausdorff dimension less 
than or equal to n — 1 . 

Then any solution fj, to the Kantorovich problem is concentrated on the graph 
of a function; that is, there exist functions G,i : Mi — > Mi such that graph(G) = 
{(xi, G2(xi), Gz{xi), ..., G m (xi))} satisfies n(graph(G)) = 1 

10 



Proof. Let Ui be a c-conjugate solution to the dual problem. Now, u\ is 
semi-concave and hence differentiable off a set of Hausdorff dimension n — 1; 
as /Ui vanishes on every set of Hausdorff dimension less than or equal to 
n — 1, by Theorem 12.11 it suffices to show that for every X\ G M\ where u\ 
is differentiable, there is at most one (#2, £3, •••,£m) £ M2 x M 3 x ... x M m 
such that Y^Li u i( x i) = c ( x ii x 2,x 3 , ...,x m ). Note that this equality implies 
that D x .c(xi,X2, ■■■,x m ) G dui(xi) for all i = 1,2..., to; in particular, as Ui is 
differentiable at xi, Dui(xi) = D Xl c(xi,X2, ...,x m ). Our strategy will be to 
show that these inclusions can hold for at most one (x2,x 3 , ...,x m ). 

Fix a point x\ where u\ is differentiable. Twistedness implies that the 
equation Dui(xi) = D xi c(xi, x 2 , ..., x m ) defines x m as a function x m = 
F xi (x 2 , ..., x m _i) of the variables x 2 , x 3 , ..., x m _i; non- degeneracy and the im- 
plicit function theorem then imply that F Xl is continuously differentiable with 
respect to £2,^3, ■■•, Xm-i an d 

D Xi F Xl (x2,...,x m ^) = -({D 2 XiXm c)- 1 D 2 XiXi c){x 1 ,x 2 , ...,F Xl {x 2 , ...^m^)) 

for i = 2,...,m — 1. We will show that there exists at most one point 
(x2,x 3 , ...,x m _i) G M2 x M 3 x ... x M m _! such that 

D Xi c(xx,X2, ■■; F X1 (X 2 , .-Xm-l)) G (9tii(Xi) 

for all % — 2, ...,m.— 1. 

The proof is by contradiction; suppose there are two such points, (x 2 , x 3 , ..., x r 
and (x2,X3, ...,x TO _i). For z = 2,...,m — 1, we can choose Riemannian 
geodesies 7»(t) in Mj such that 7«(0) = x« and 7»(1) = xi- Take a measurable 
selection of covectors Vi(t) G duii^fiit)). We will show that /(l) < /(0), 
where 

m— 1 , 



dt 



and we have used (xi,7(t)) as a shorthand for (x\, 72(0, ...,7 m _i(t), F Xl (7 2 (t), ...,7 m _i(t))) 
and a < b > to denote denote the duality pairing between a 1-form a and a 
vector b. This will clearly imply the desired result. 

For each t and each i — 2, ..., to — 1, by c-conjugacy of m» and the com- 
pactness of Mj for j 7^ z, we can choose y(i',t) = (yi(i; t), y 2 (i] t), ..., y m (i : 
£)) G Mi x M 2 x ... x M m so that y^i; t) = ^(t) and 

in 

53 %(%•(*;*)) =c(yi(i;t),y2(i;t),...,y m (i;t)) 

11 



Note that Vi(t){-£) supports the semi-concave function T E [0, 1] i— > Ui("fi(t)). 
But Mj(7j(t)) is twice differentiable almost everywhere and hence we have 
Vi(t){^) = d(n ' ( J; (f))) for almost all t and, by semi-concavity 1^(1) (^) - 
Vi(0)(^) < £ d2{u jl! m dt. Now, for any t, s G [0, 1] 

Uihi(t)) < c(y 1 (i;s),y 2 (i;s),...,y i _ 1 (i;s),7 i (t),y i+1 (i;s)...,y rn (i;s))-^u j (y j (i;s)) 

and we have equality when t — s, as 7«(s) = yi(i;s). Hence, whenever 
— j t 2 exists, we have 



d^u^it))) 



dt 2 



< rf 2 (c(j/i(i;g),j/ 2 (^;s),...,^ -i(2; s), 7i(t), yi+i(i; s)..., y m (z; s))) 



dt 2 

= Hess Xi c(y!(i; s),y 2 (i; s), ..., y m (i; s))(-r 1 , -r 1 ) 

as as 



We conclude that 

V^i^-Vmi^)^ f Hess x Ayi^t),y 2 (t;t),...,y^t))(^,^)dt 

(4) 
Turning now to the other term in /(l) — /(0), we have 

i=2 / 



J'=2 



m—l 



E ((^^(^^)" 1 ^^)(^ 5 7W))(^,^) U (5) 



\Ku XiXm cyu XiXm c) u XlX .c)\ < x 1 ,'y[V))j\ 

J---2 

12 



t=s 



Combining fll]) and §5§ yields 

/(!) - /(o) < Jo T ^ xi>1(f) y mt)>m . t)t ^ im _ 1 . t) (-i' -£) dt 

<o 

n 

Corollary 3.2. Under the same conditions as Theorem 1, the Monge problem 
M admits a unique solution and the solution to the Kantorovich problem K 
is unique. 

Proof. We first show that the Gj defined in Theorem 13.11 push Hi to /ij for 
all i = 2, 3, ..m. Pick a Borel set B e Mj. We have 

^(B) = ^(Mi x M 2 x ... x Mj_i x B x M i+1 x ... x M m ) 

= (i((Mi x M 2 x ... x Mj_i x £ x Mj + i x ... x M m ) ngraph(G)) 

= fi({ (xi, G 2 (x 1 ), ..., G TO (xi)|G ! i(ari) G Bjj 

= ^((Gr 1 (B)xM 2 x...xM TO )ngraph(3)) 

= niG^iB) x M 2 x ... x M m ) 
= m{GT\B)) 

This implies that (G 2 , G3, ..., G m ) solves M. To prove uniqueness of /i, note 
that any other optimizer /I must also be concentrated on graph(G), which 
in turn implies ~p = (Id, G 2 , ..., G m )#jUi = \i. Uniqueness of (G 2 ,G 3 , ..G m ) 
now follows immediately; if (G 2 ,G3, ...,G m ) is another solution to M then 
(Id, G 2 , G3, ..., G m )#/jLi is another solution to K, which must then be concen- 
trated on graph(G). This means that Gj = Gj, /ii almost everywhere. D 

4 Examples 

In this section, we discuss several types of cost functions to which Theorem 
13.11 applies. In these examples, the complicated tensor T simplifies consider- 
ably. 
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Example 4.1. (Perturbations of concave functions of the sum) Gangbo and 
Swiech ^ and Heinich FTD{ treated cost functions defined on (R n ) m by c(xi, x 2 , ..., c 
/i(X^fcLi ^fc) where h : M n — > K. is strictly concave. Here, we make the slightly 
stronger assumption that h is C 2 with D 2 h < 0. Assuming each /i, zs com- 
pact supported, we can take each Mi to be a bounded, convex domain in 
R n . Now, D Xi c(x 1 ,x 2 ,...,x m ) = Dh(Y^ =l x k ) and Dl iX .c(x 1 ,x 2 , -,x m )) = 
D 2 h(J2'k=i x k), where we have made the obvious identification between tan- 
gent spaces at different points, c is then clearly (1, m)-twisted and (1, m)-non- 
degenerate. Furthermore, the bi-linear map Sg on (R n ) m ~ 2 is block diagonal, 
and each of its diagonal blocks is 

in 

D 2 h(J2y k ). 
fc=i 

Similarly, as Hess Xi c(y(i)) = D 2 h(Y^ =l yk{i)) and Hess Xi c(y) = D 2 h(J2T=i Vk), 
Hy t y(2),y(3),...,y( m -i) is block diagonal and its ith diagonal block is 

m m 

D 2 h(J2y k ^)-D 2 h(J2y k )- 

k=l fe=l 

Therefore, Ty-^2),y(3),...,y(m-i) is block diagonal and its ith diagonal block is 

Til 

D 2 hC£y k {i))- 
k=i 

This is clearly negative definite. Furthermore, C 2 perturbations of this cost 
function will also satisfy T$£(2),y(S),...,y{m-i) < 0; this shows that the results of 
Gangbo and Swiech and Heinich are robust with respect to perturbations of 
the cost function. 

Example 4.2. (Bi-linear costs) We now turn to bi-linear costs; suppose c : 
(R n ) m — >■ R is given by c(xi,X2, ...,x m ) = Y2i^j( x i) Tj ^ij x j f orn byn matrices 
Aij. If A Vm is non-singular, c is (l,m)-twisted and (l,m) -non- degenerate. 
Now, the Hessian terms in T vanish and so the condition T < becomes a 
condition on the A^. For example, when m = 3, we have T = A2i(Azi)~ 1 A32 ; 
T < is the same condition that ensures the solution to K is contained in 
an n dimensional submanifold in fTE/ . 
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Note that after changing coordinates in x 2 ond x 3 , we can assume any 
bi-linear three-marginal cost is of the form 

c(xi,x 2 , x 3 ) = X\ ■ x 2 + X\ ■ x 3 + x\Ax 3 

In these coordinates, the threefold product A 2 \{A 3 \)~ l A 32 = A T . Applying 
the linear change of coordinates 

xi i—)- UiXi 
x 2 >->■ U 2 x 2 
x 3 H- U 3 x 3 

yields 

c(xx,x 2 , x 3 ) = x[Ul~U 2 x 2 + x\ Ul U 3 x 3 + x 2 U 2 AU 3 x 3 

If A is negative definite and symmetric, then we can choose U 3 = U 2 such 
that U^AU 3 = -I and U x = -{U^)' 1 to obtain 

( \ _ T T T 

C[Xi, X 2 , X 3 ) — — X-y X 2 — X^X 3 — x 2 2:3 

which is equivalent o to the cost of Gangbo and Swiech. As the symme- 
try of D\ 2Xl c{Dl. zXi cY l Dl aX2 c is independent of our choice of coordinates, 
we conclude that c is equivalent to Gangbo and Switch's cost if and only if 
^42i(^43i) _1 ^32 is symmetric and negative definite. Thus, when m — 3 our 
result restricted to bi-linear costs generalizes Gangbo and Switch's theorem 
from costs for which A 2 i(A 3 i)~ l A 32 is symmetric and negative definite to 
ones for which it is only negative definite. 

Example 4.3. There is another class of three marginal problems which The- 
orem \3.1\ applies to: on W 1 x W 1 x M. n , set 

1 12 1 12 

/ n, / ^ , \Xl — x 2 \ \x 3 — x 2 \ 

c{x u x 2 , x 3 ) = g{x 1 , x 3 ) H 1 . 

If g(xi,x 3 ) = ' Xl ~* 3 ' } this is equivalent to the cost of Gangbo and Swiech. 
More generally, if g is (1 , 3) -twisted and non-degenerate, then c is as well. 



2 We say cost functions c and c are equivalent if ~c(x\,xi, ...,x m ) ~ c(x\, £2, ■■-, x m ) + 
Y^iLi 9i( x i)- As the effect of the g^s is to shift the functionals C(Gi, G3, ..., G m ) and C(/z) 
by the constant Xa=i Jm ff^^O^A*^ studying c is essentially equivalent to studying c. 
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Moreover, if we make the usual identification between tangent spaces at dif- 
ferent points in M. n , we have 

Ty, m = (D 2 XiX3 g{ yi ,y 3 )) . 

Hence, if Dl iX3 g(yi,y 3 ) < 0, we have Ty^ 2 ) < 0. This will be the case if, 
for example, g(xi,x^) = h(x± — X3) for h uniformly convex or g(xi,Xs) = 
h(x\ + £3) for h uniformly concave. 

Example 4.4. (Hedonic Pricing) Chiappori, McCann and Nesheim J^ and 
Carlier and Ekeland ^ showed that finding equilibrium in a certain hedonic 
pricing model is equivalent to solving a multi-marginal optimal transportation 
problem with a cost function of the form c(xi, x 2 , ■ ■■,x m ) = ini zeZ Y^hLi fi( x ii z ) 
Let us assume: 

1. Z is a C 2 smooth n-dimensional manifold. 

2. For all i, fi is C 2 and non- degenerate. 

3. For each (xi,x 2 , •••, x m ) the infinum is attained by a unique z(x\, x 2 , ■■■, x r , 
Z and 

4- Y17=i D2 zzfi( x ii z ( x ii x 2> ■■■, x m)) is non- singular. 

In fT3J . we showed that these conditions implied that c is C 2 and (i, ^-non- 
degenerate for all i 7^ j ; we then showed that the support of any optimizer 
is contained in an n-dimensional Lipschitz submanifold of the product Mi x 
M 2 x ...M m . Here we examine conditions on the fi that ensure the hypotheses 
of Theorem \3.1\ are satisfied. If, for fixed i ^ j, we assume in addition that: 

5. fi is Xi,z twisted (that is, z H- D x .fi(xi,z) is injective) and 

6. fj is z, Xj twisted. 

then c is (i,j) -twisted. Indeed, note that c(xi, x 2 , ..., x m ) < Yl^Li fi{ x i-> z )) 
with equality when z = z(xi,x 2 , ...,x m ); therefore, 

D Xi c(xi,x 2 , ..., x m ) = D x J(xi, z(xi,x 2 , ..., x m )) (6) 

Therefore, for fixed x^ for all k ^ j, the map Xj i— > D x .c(xi,x 2 , ..., x m ) is 
the composition of the maps Xj i— > z(xi, x 2 , ..., x m ) and z i— > D Xi f(xi, z). The 
later map is injective by assumption. Now, note that 



} j D z fi(xj, z(xx,x 2 , ..., x m )) = 0; 



fc=i 
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hence, 



DzfjyXji Z{X\, X 2 , •••, X m )j — — y D z fk{Xk, z(xx, X 2 , •••, x m )). 

Mi 

Twistedness of fj now immediately implies injectivity of the first map. 

We now investigate the form of the tensor T. 

As A(x 1 ,x 2 ,...,x m ) := Yl T !LiD 2 zz fi(x i ,z(x 1 ,X2,...,x m )) is non- singular by 
assumption, the implicit function theorem implies that z(xi,x 2 , ...,x m ) is dif- 
ferentiable and 

D x ,z(x 1 ,x 2 , :.,x m ) = -{A(x 1 ,x 2 , ...,x m )) D 2 zx J i (x i ,z(x 1 ,x 2 , ...,x m )) 

Furthermore, note that as A is positive semi-definite by the minimality of 
z i—)- Y^iLi fi( x ii z )) a t z(xx,x 2 , ...,x m ), the non-singular assumption implies 
that it is in fact positive definite. 

Differentiating |6]] with respect to x% for i = 2, 3, ..m — 1 yields: 

Hess Xi c = -(DlJ t )D Xi z + Hess^fr = -{DljAA^iD^jA + Hess x J t . 

where we have suppressed the arguments Xx,x 2 , --X m and z(xi,x 2 , ...,x m ). A 
similar calculation yields, for all i ^ j , 

D 2 XlX] c = {Blj % )D x .z = -(Dl i J i )A-\Dl.f i ) 

Thus, for alii ^ j, a straightforward calculation yields 

D\ Xm c{D\ Xm c)~ x Dl iX] c = -(DljAA-^Dl.fr) = tif^c, 

Hence, S$ is block diagonal. Furthermore, another simple calculation implies 
that its ith diagonal block is 



D 2 c(D 2 c) _1 D 2 



{D 2 x J l )A-\D 2 zx J l )](y,z{y)). 



In addition, Hjj^ 2 ),y(3),...,y(m-i) is block diagonal and its ith block is 

\D 2 XiZ fAA-\D 2 zx J t )\ (y{i),z(y{i))) +He88 ai f i (y i ,z(fti)) 
\DlJAA-\DljA] (y,z(y)) - Hess Xi U{y h z{y)) 



+ 
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Hence, T^^2),y(3),...,y(m-i) is block diagonal and its ith block is 

(D XiZ fi)A~ x (Dl x .fi) [y(i),z(y(i)))+Hess x J i [y i ,z(y(i)))-Hess x J i [y i ,z( 

(7) 
Therefore, T^(2),y(3),-,y(m-i) is negative definite if and only if each of its 
diagonal blocks is. Now, A is symmetric and positive definite; therefore A^ 1 
is as well. The first term in the ith block of |?p is therefore negative definite; 
the entire block will be negative definite if this term dominates the difference 
of the Hessian terms. This is the case if, for example, Mi = M. n and f\ takes 
the form fi(xi, z) = Xi(Xi(z) +/3i(xi) + Xi(z) for all i = 2, 3, ..., m — 1, in which 

case Hess x Ji(y h z(y(i))j = Hess x Ji(yi, z{\ 
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